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© Expression of HTLV-III gag-Gene. 



© A recombinant gag-protein of the etiologic agent 
of acquired immune deficiency syndrome (AIDS) and 
the proteolytic proteins produced therefrom as well 
as corresponding vectors and transformants express- 
ing these proteins are disclosed. In addition, a meth- 
^ od of testing human blood for presence of antibodies 
to the AIDS virus using the recombinant gag-protein 
|Nor any of its proteolytic proteins or mixtures thereof 
C^is disclosed. 
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Expression of HTLV-HI gag-Gene 



Background of the Invention 

The retrovirus HTLV-III and the closely related 
variants of this virus. LAV and ARV, appear to be 
the causative agents of the disease Acquired Im- 
munodeficiency Syndrome (AIDS) [see Barrg- 
Sinoussi et al., Science 220, 868-87! (I983); Levy et 
al., Science 225, 840-842 (I984); Montagnier et al., 
in Human T-Cell Leukemia/Lymphoma Virus, R. C. 
Qallo, M. Essex and L Gross, eds. (Cold Spring 
Harbor, New York: Cold Spring Harbor Laboratory) 
pp. 363-370 (1984); Popovic et al., Science 224 . 
497-500 (1984): Gallo et al., Science, 224 500-503 
(1984); SchOpbach et al., Science 224, 503-505 - 

(1984) ]. There is strong correlation between AIDS 
and the presence of antibodies to HTLV-III. Further- 
more, 85-95% of patients with lymphadenopathy 
syndrome and a significant proportion of asymp- 
tomatic homosexual men in AIDS endemic areas 
carry circulating antibodies to HTLV-III [see 
Schupbach et al., supra]. HTLV-III antibodies have 
also been widely detected in patients who were 
exposed to this disease through intravenous drug 
injections with contaminated needles and in 
hemophiliacs who received intravenous blood pro- 
ducts. Current estimates indicate that approximate- 
ly one million Americans have been infected with 
this virus, and approximately 10% of this infected 
population is expected to acquire this lethal dis- 
ease. 

Molecular cloning and nucleotide sequence 
analysis of HTLV-III and its variants have dem- 
onstrated that this viral genome exhibits many of 
the structural features of the avian and mammalian 
w retroviruses [see Ratner et al., Nature 313, 277-284 

(1985) : Sanchez-Pescador et al., Science 227, 484- 
492 (1985); Wain-Hobson, et al., Cell 40, 9-17 (1985); 
and Muesing, M. et al., Nature 313, 450-458 (1985)]. 
Thus, the viral genome contains the three genes - 
(gag, pol and env) characteristic of ail retroviruses. 
In addition, the HTLV-III genome contains two short 
open reading frames whose function are unknown. 

Effective containment of AIDS depends on de- 
velopment of sensitive and rapid methods to iden- 
tify individuals exposed to or infected with HTLV-III 
and therapeutic agents that interfere with viral repli- 
cation. One of the viral genes, gag, encodes a 
precursor which is proteolytically processed into 
core proteins during virion maturation. From DNA 
sequence data and analysis of isolated viral pro- 
teins it follows that the HTLV-III gag precursor 
comprises about 56 kd and is processed into spe- 
cies of approximately 24, 16, and 14 kd (Ratner et 
al., supra; Sanchez-Pescador et al., supra; Wain- 



Hobson et al., supra; and Muesing et al., supra). 
The protease responsible for this processing is 
typically encoded by the retroviral genome. It is 
included in the 3' end of the gag gene in avian 

5 retroviruses and in the 5' end of the pol gene in 
mammalian viruses [for review see Dickson et al., 
"Protein Biosynthesis and Assembly", in Molecular 
Biology of Tumor Viruses, R. A. Weiss, N. M. 
Teich, H. E. Varmus and J. M. Coffin, eds. (Cold 

70 Spring Harbor Laboratory: Cold Spring Harbor, 
NY) pp. 513-648 (1982)]. In at least one mammalian 
retrovirus, Moloney murine leukemia virus (MuLV), 
the protease is a gag-pol read-through product. A 
therapeutic agent that could inhibit this protease 

15 might block virus spread. It is, therefore, important 
to identify the region of the HTLV-III genome that 
encodes this protease and to develop an in vitro 
system in which the proteolysis of the gag gene 
precursor can be studied. 

20 HTLV-HI genomes have been molecularly clon- 
ed. Shaw et al., Science 226, II65-II7I (1984). Also 
the complete nucleotide sequence of the proviral 
genome of HTLV ill has been determined [Ratner 
et al., supra: and Sanchez-Pescador, et al., supra]. 

25 One reason for the difficulty in determining the 
etiologic agent of AIDS was due to the reactivity of 
various retroviral antigens with serum samples from 
AIDS Patients. For example, serum samples from 
AIDS patients have been shown to react with anti- 

30 gens of HTLV I and HTLV III (HTLV-I: Essex et al., 
"Antibodies to Cell Membrane Antigens Associated 
with Human T-Cell Leukemia Virus in Patients with 
AIDS", Science 220, 859-862 (1983); HTLV-III: Sam- 
gadharan et al., "Antibodies Reactive With Human 

35 T-Lymphotropic Retroviruses (HTLV-III) in the Se- 
rum of Patients With AIDS", Science 224, 506-508 
(1984)). Gene products of HTLV demonstrated an- 
tigenicities cross-reactive with antibodies in sera 
from adult T-cell leukemia patients [Wyokawa, T. et 

40 al., "Envelope proteins of human T-cell leukemia 
virus: Expression in Escherichia coli and its ap- 
plication to studies of env gene functions", PNAS - 
(USA) §1, 6202-6206 (1984)]. Adult T-cell leukemias 
(ATL) differ from acquired immune deficiency syn- 

45 drome (AIDS) in that HTLV-I causes T-cell malig- 
nancies, that is uncontrolled growth of T-cell. In 
AIDS rather than cell growth there is cell death. In 
fact this cytopathic characteristic of HTLV III was 
critical to determining ultimately the specific retro- 

50 viral origin of the disease. Thus the etiologic agent 
of AIDS was isolated by use of immortalized hu- 
man neoplastic T-cell lines (HT) infected with the 
cytopathic retrovirus characteristic of AIDS, isolated 
from AIDS afflicted patients. Seroepidemiological 
assays using this virus showed a complete cbrrela- 
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tion between AIDS and the presence of antibodies 
to HTLV III antigens [Samgadharan et al. P supra - 
(1984); SchQpbach et ai., supra]. In addition, nearly 
85% of patients with lymphadenopathy syndrome 
and a significant proportion of asymptomatic ho- 
mosexual men in AIDS endemic areas were also 
found to carry circulating antibodies to HTLV III. 
Taken together, ail these data indicate HTLV 111 to 
be the etiologic agent for AIDS. 

Until the successful culturing of AIDS virus 
using H-9 cell line the gag AIDS protein of the 
AIDS virus had not been isolated, characterized or 
synthesized. This in major part is due to the fact 
that the virus is cytopathic and thus isolation of the 
virus was not possible [Popovic, M. et ai., supra]. 
Once the human T-cell line resistant to the 
cytopathic effects of the vims was discovered, a 
molecular clone of proviral DMA could be achieved. 

The need for a sensitive and rapid method for 
the diagnosis of AIDS in human blood and its 
prevention by vaccination is very great. Virtually all 
the assays/tests presently available are fraught with 
errors, in fact the Center for Disease Control - 
(CDC) has indicated that presently available tests 
be used solely for screening units of blood for 
antibody to HTLV III. The CDC went further by 
stating that the presently available ELISA tests can 
not be used for general screening of high risk 
populations or as a diagnostic test for AIDS 
[Federal Register 50(48), 9909. March 12, 1985]. 
The errors have been traced to the failure to use a 
specific antigenic protein of the etiologic agent for 
AIDS. The previously used proteins were derived 
from a viral lysate. Since the lysate is made from 
human cells infected with the virus, i.e. the cells 
used to grow the virus, the lysate will contain 
human proteins as well as viral proteins. Thus 
preparation of a pure antigen of viral protein is very 
difficult The antigen used produced both false 
positive and false negative results [Budiansky. S. r 
AIDS Screening, False Test Results Raise Doubts, 
Nature 3]2. 5830984)]. The errors caused by the 
use of such lysate proteins/peptides can be avoid- 
ed by using a composition for binding AIDS anti- 
bodies which is substantially free of the non-AlDS 
specific proteins. Compositions that are substan- 
tially pure AIDS gag-proteins can be used as anti- 
gens. 



Summary of Invention 

In accordance with this invention we have pro- 
duced by recombinant technology the im- 
munologically active portion of the precursor of the 
HTLV-III gag protein as well as the immunological 
active portion of the natural proteolytic precusor 
proteins which result from this gag protein - 



(hereinafter also referred to as polypeptides im- 
munologically equivalent to the gag-protein pro- 
ducts of HTLV-III). In addition, we have produced a 
recombinant organism capable of expressing these 

5 proteins/polypeptides by utilizing various expres- 
sion vectors capable of expressing all of these 
proteins. By recombinant technology in accordance 
with this invention, one produces the precursor gag 
56 kd protein with modification resulting from the 

to removal of N-terminal codons at its amino termi- 
nus, but having the same immunological activity as 
the natural precursor gag protein. In view of this 
modification in the precusor, the 14 kd gag protein 
which is proteolytically produced therefrom is also 

75 modified at its amino terminus from the natural 14 
kd gag protein. However, this modified gag protein 
also has the same immunological activity as its 
natural form. Also in accordance with this invention, 
a new p48 protein is produced having the same 

20 immunological activity of the natural precursor gag 
protein. The 16 kd proteolytic protein produced by 
the process of this invention has a variant in its 
amino acid structure from the proteolytic 16 kd gag 
proteins reported in Shaw et al M supra. The 24 kd 

25 proteolytic protein is the same as the naturally 
occuring 24 kd proteolytic protein. 

The 56 kd, 24, 16 and 14 kd gag proteins 
. produced by this invention have the same im- 
munological activity as their corresponding natural 

30 gag proteins. They have the same epitopes to 
react with the same antibodies as their correspond- 
ing natural proteins. 

In accordance with this invention, recombinant 
DNA techniques are utilized for producing the 

35 HTLV-III gag protein having 56 kd as well as the 
proteolytic proteins produced therefrom, i.e. the 
proteins having 24,16 and 14 kd, respectively. In the 
first step of this invention, one isolates from the 
known genome for the retrovirus HTLV-III a portion 

40 which contains the DNA sequence encoding for the 
gag protein of HTLV-III and this portion is con- 
structed into a gene which contains this DNA se- 
quence encoding for the gag protein of HLTV-IU 
operably linked to a promoter capable of effecting 

45 the expression of said DNA sequence and this 
gene is inserted into an expression vector or plas- 
mid and such vector or plasmid is inserted into a 
suitable microorganism, preferably a yeast cell, to 
produce a microorganism capable of expressing 

50 the active portion of the gag-protein of HTLV-III. In 
accordance with this invention, the recombinant 
organism not only expresses a protein which is 
immunological equivalent to the natural precusor 
gag protein, but also expresses the species pro- 

55 teins which are processed by proteolytic enzymes 
expressed with the precusor protein. 
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Also included in this invention are those amino 
acid substitution in the sequence of the 56, 48, 24, 
16 and 14 kd gag proteins produced through muta- 
tion of the recombinant microorganisms of this 
invention. These amino acids substitutions in the 
sequence of these proteins produce modified pro- 
teins which are immunologically equivalent to the 
proteins of which they are modifications. These 
amino acid substitutions are described in the art - 
[H. Neurath and R.L. Hill "The Proteins", Academic 
Press, New York (1979)] in particular in fig. 6 of 
page 14. The most frequently observed amino acid 
substitutions are Aia/Ser, Val/lle, Asp/Qlu, Thr/Ser, 
Ala/Gly, Ala/Thr, Ser/Asn, Ala/Val, Ser/Gly, Tyr/Phe, 
Aia/Pro. Lys/Arg, Asp/Asn, Leu/lle, LeuA/al, Ala/Glu, 
Asp/Giy, and vice versa. 

A further aspect of this invention relates to a 
diagnostic method for testing human blood for the 
presence of antibodies to the gag protein and its 
proteolytic proteins. This aspect of the invention 
overcomes the problems of previously used blood 
tests for AIDS. One of the problems in detecting in 
vitro the AIDS virus is to provide a composition 
which does not contain proteins or peptides which 
are not derived solely from the AIDS etioiogic 
agent. A composition using either the active portion 
of gag-protein or its various proteolytically derived 
proteins overcomes the nonspecificity of the prior 
tests or assays. Yet another aspect of this invention 
is a diagnostic method for detecting and/or deter- 
mining the presence of the antigen in human blood. 

Another aspect of this invention is to use either 
the recombinant gag-protein or its proteolytically 
derived proteins as antigens in providing antibodies 
which are active in detecting AIDS in samples of 
body fluid. 

Just another aspect of this invention is to use 
either the recombinant gag-protein or its prot- 
eolytically derived proteins as a vaccine capable of 
inducing protective immunity against the AIDS 
virus. Routes of administration, antigen doses, 
number and frequency of injections will vary from 
individual to individual and may parallel those cur- 
rently being used in providing immunity in other 
viral infections. The vaccines can be prepared in 
accordance with known methods. The vaccine 
compositions will be conveniently combined with 
physiologically acceptable carrier materials. The 
vaccine compositions may contain adjuvants or any 
other enhancer of immune response. Furthermore, 
the vaccine compositions may comprise other anti- 
gens to provide immunity against other diseases in 
addition to AIDS. 

The methods for testing human blood for the 
presence of AIDS virus or of antibodies against 
AIDS virus can be conducted in suitable test kits 
comprising in a container a recombinant gag-pro- 



tein or its proteolytically derived proteins of the 
present Invention or antibodies against AIDS virus 
elicited by these proteins of the present invention. 

5 

Brief Description fit M Drawings. 

Fig. I-A illustrates the restriction sites in X 
HXB-3, a known gene clone for the HTLV-HI virus, 

ro with the gag region of this gene expanded to show 
the precursor (p 56) and its naturally proteolytically 
produced proteins (p 24, p 16, and p 14) as well as 
its mutant protein (p 48). 

Fig. hB illustrates the construction of a gene 

75 containing the gag-gene for HTLV-HI and a pro- 
moter for later insertion into a plasmid. 

Rg. 2 is an immunoblot analysis of yeast 
lysates obtained from yeast grown with pYE72/gag 
I. Column I is the reading taken from ceils grown in 

20 a high phosphate medium and column 2 is the 
reading from cells grown in a phosphate-free me- 
dium. The indicated molecular weight markers in- 
dicate the sizes of the respective bands. 

Figure 3 is an autoradiography reading of an 

25 SDS gel at various times after immunoprecipitated 
lysates produced from yeast cells containing the 
gag plasmid pYE72/gag I grown with *S- 
methionine. Columns A through G in Fig. 3 repre- 
sent different chase times. Column H represents 

30 results from lysates from yeast cells containing 
pYE 72/gag I grown with a P0 4 and then harvested 
after 45 minutes and column I represents similar 
results as Column G except the plasmid inserted 
had no gag gene. 

as Fig 4 is an immunoblot analysis of yeast 

lysates produced through various mutations of the 
gag gene. The immunobiots were developed either 
with rabbit antibodies raised against disrupted 
HTLV-HI (Part A of Fig, 4) or with AIDS patient 

40 serum (Part B of Rg. 4). In Parts A and B, column I 
is the result of lysates from cells induced with 
pYE72/gag I, while column 2 is the results of the 
lysates from cells induced with pYE72/gag 2 and 
column 3 is the results of the lysate from cells 

45 induced with pYE72/gag 3. 

Rg 5 is the DNA sequence of that portion 
the X HXB-3 gene showing the gag/pol overlap. 
The carboxy-terminal coding region of gag and the 
amino-terminal coding region of gol are shown. The 

so region that is homologous to other gag proteases - 
[Toh et al.. Nature 3|5, 691 (1985)] is underlined. 
The reading frame to which the pol gene is shifted 
by the Bell fill in is shown by the arrow, periods i.e. 
\" in this figure indicate a translation termination 

55 site. 

Rg. 6-A is an immunoblot analysis utilizing 
antigen produced by the lysates of yeast cells 
induced with pYE72/gagl, each of the columns re- 
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presents blood samples taken from a different 
AIDS patients living at the east coast of the United 
States. 

Fig. 6-B is the same as Fig. 6-A except that 
the patients are taken from the west coast of the 
United States. 

Fig. 7 is the DNA sequence encoding the 56 
kd gag protein precusor produced in accordance 
with this invention. 

Fig. 8 is the amino acid sequence of the 56 
kd gag protein precusor produced in accordance 
with this invention. 

Fig. 9 is the DNA sequence encoding the 
proteolytic 24 kd gag protein produced in accor- 
dance with this invention. 

Rg. 10 is the amino acid sequence of the 24 
kd proteolytic gag protein is produced in accor- 
dance with this invention. 

Rg. II is the DNA sequence encoding the 
proteolytic 16 kd gag protein produced in accor- 
dance with this invention. 

Rg. 12 is the amino add sequence of the 16 
kd proteolytic gag protein produced in accordance 
with this invention. 

Rg. 13 is the DNA sequence encoding the 14 
kd proteolytic gag protein produced in accordance 
with this invention. 

Rg. 14 is the amino acid sequence of the 14 
kd proteolytic gag protein produced in accordance 
with this invention. 

Rg. 15 is the DNA sequence encoding the 48 
kd proteolytic gag protein produced in accordance 
with this invention. 

Rg. 16 is the amino acid sequence of the 48 
kd proteolytic gag protein produced in accordance 
with this invention. 



Detailed Description of the Invention 

In the description the following terms are em- 
ployed: 

Nucleotide: A monomeric unit of DNA consist- 
ing of a sugar moiety (pentose), a phosphate, and 
either a purine or pyrimidine base (nitrogenous 
heterocyclic). The base is linked to the sugar moi- 
ety via the glycosidic carbon (I' carbon of the 
pentose). That combination of a base and a sugar 
is called a nucleotide. Each nucleotide is character- 
ized by its base. The four DNA bases are adenine - 
("A"), guanine CG"), cytosine ("C") and thymine - 
(T"). 

DNA Seauence : A linear array of nucleotides 
connected one to the other by phosphodiester 
bonds between the 3' and 5' carbons of adjacent 
pentoses. 



Codon : A DNA sequence of three nucleotides 
(a tripiet) which encodes through mRNA an amino 
add, a translation start signal or a translation ter- 
mination signal. For example, the nucleotide triplets 

5 TTA, TTG, CTT, CTC, CTA and CTG encode for 
the amino add leucine ("Leu"). TAG, TAA and 
TGA are translation stop signals and ATG is a 
translation start signal. 

Reading Frame: The grouping of codons during 

io translation of mRNA into amino add sequences. 
During translation the proper reading frame must 
be maintained. For example, the sequence 
GCTGGTTGTAAG may be translated in three read- 
ing frames or phases, each of which affords a 

75 different amino add sequence: 

GCT GGT TGT AAG-Ala-Gly-Cys-Lys 
G CTG GTT GTA AG-Leu-Val-Val 
GC TGG TTG TAA G-Trp-Leu-(STOP) 

Polypeptide : A linear array of amino adds 

20 connected one to the other by peptide bonds be- 
tween the a-amino and carboxy groups of adjacent 
amino adds. 

Genome : The entire DNA of a cell or a virus. It 
includes inter alia the structural genes coding for 

25 the polypeptides of the substance, as well as oper- 
ator, promoter and ribosome binding and inter- 
action sequences, including sequences such as the 
Shine-Dalgamo sequences. 

Structural Gene : A DNA sequence which en- 

30 codes through its template or messenger RNA 
("mRNA") a sequence of amino adds characteris- 
tic of a specific polypeptide. 

Transcription: The process of produdng mRNA 
from a structural gene. 

35 Translation : The process of producing a poly- 
peptide from mRNA. 

Expression: The process undergone by a struc- 
tural gene to produce a polypeptide, it is a com- 
bination of transdption and translation. 

40 Plasmid : A drcular double-stranded DNA Mol- 

ecule that is not a part of the main chromosome of 
an organism containing genes that convey resis- 
tance to spedfic antibiotics. -When the plasmid is 
placed within aunicellular organism, the characteris- 

45 tics of that organism may be changed or trans- 
formed as a result for the DNA of the plasmid. For 
example, a plasmid carrying the gene for 
tetracycline resistance (Tet*) transforms a cell pre- 
viously sensitive to tetracycline into one which is 

so resistant to it A cell transformed by a plasmid is 
called a "transformant." 

Cloning Vehicle : A plasmid, phage DNA or 
other DNA sequences which are able to replicate in 
a host cell, which are characterized by one or a 

55 small number of endonuclease recognition sites at 
which such DNA sequences may be cut in a deter- 
minable fashion without attendant loss of an essen- 
tial biological function of the DNA, e^, replication, 
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production of coat proteins or loss of promoter or 
binding sites, and which contain a marker suitable 
for use in the identification of transformed cells, 
e.g.. tetracycline resistance or ampicillin resistance. 
A cloning vehicle is often called a vector. 

Cloning: The process of obtaining a population 
of organisms or DNA sequences derived from one 
such organism or sequence by asexual reproduc- 
tion. 

Recombinant DNA Molecule or Hybrid DNA : A 
molecule consisting of segments of DNA from dif- 
ferent genomes which have been joined end-to-end 
outside of living cells and have 4he capacity to 
infect some host cell and be maintained therein. 

The nomenclature used to define the peptides 
or proteins is that used in accordance with conven- 
tional representation such that the amino group at 
the N-terminus appears to the left and the carboxyl 
group at the C-terminus to the right By natural 
amino acid is meant one of common, naturally 
occurring amino acids found in proteins comprising 
Gly, Ala, Val, Leu, lie, Ser, Thr, Lys, Arg, Asp, Asn, 
Glu, Gin, Cys, Met, Phe, Tyr, Pro, Trp and His. 
Where the amino acid residue has isomeric forms, 
it is the L-form of the amino acid that is repre- 
sented unless otherwise expressly indicated. In ad- 
dition, amino acids have been designated by spe- 
cific letters of the alphabet such that: A = Alanine; 
D=Aspartic Acid; N = Asparagine; C = Cysteine; 
D = Aspartic ■ Acid; E = Glutamic Acid; 
F= Phenylalanine; G= Glycine; H = Histidine; 
I = Isoleucine; K = Lysine; L= Leucine; 

M = Methionine; N = Asparagine; P = Proline; 
Q = Glutamine; R = Arginine; S = Serine; 
T= threonine; V = Valine; W = Tryptophan; 
Y = Tyrosine; Q = Glutamine; E = Glutamic Acid. 

In accordance with the present invention, the 
search for the protein of the etiofogic agent for 
acquired immune deficiency syndrome (AIDS) has 
led to the isolation and sequencing of the proviral 
gene of the AIDS virus. It has now been discov- 
ered, for what is believed to be the first time that 
the postulated etiologic agents of AIDS, 
lymphadenopathy-associated virus (LAV), AIDS-As- 
sociated retrovirus (ARV) and human T-cell 
leukemia/lymphoma/lymphotropic virus (HTLV ill) 
are in fact variants of the same virus. For purposes 
of this invention and claims the virus causing AIDS 
will be referred to herein as HTLV-III virus. HTLV-III 
virus will be understood to include the variants 
which have been postulated as the causative agent 
of AIDS, namely LAV and ARV . 



As seen in Rg. I, the genome for HTLV-III is 
known, i.e. XHXB-3 and contains regions which 
code for the gag-protein, pol protein, sor-protein 
and envelope(env)-protein. The region of the 

5 genome which codes for the gag-protein is found 
within the 5.5 kb EcoRI fragment region and more 
particular within the Cla I through Bel I region. 

In accordance with this invention, in order to 
obtain the proteins of this invention, the HTLV-III 

10 gene is cut with one or more restriction enzymes to 
obtain the fragment which contains the gene en- 
coding the gag-protein. This fragment is then ligat- 
ed with a promoter to form a gene containing the 
promoter operably linked to a DNA sequence cod- 

75 ing for the gag-protein. It is through this linking that 
modification at the amino terminus of the protein 
produced therefrom are introduced when ex- 
pressed in an organism. In the next step the gene 
containing the promoter and DNA sequence encod- 

20 ing the gag-protein of HTLV-III is then inserted into 
a plasmid or expression vector repiicable in a suit- 
able microbiological host to form a plasmid or 
expression vector containing the promoter operably 
linked to the DNA sequence encoding the gag- 

25 protein for HTLV-III. 

In the current state of the art, there are a 
number of promoter systems and suitable microbial 
hosts available which are appropriate to the present 
invention. Also, there are many types of plasmids 

so into which the gene encoding the gag-protein of 
HTLV-III can be inserted. In general, plasmid ex- 
pression vectors containing replication and con- 
trolled sequence, which are derived from species 
compatable with the host cell are used in connec- 

35 tion with these hosts. For example, E. coli is typi- 
cally transformed using plasmid pBR322, a plasmid 
derived from an E. coli species. For use with yeast, 
such as S. cerevisiae a plasmid such as pYE7 is 
generally utilized. 

40 In accordance with this invention, any conven- 
tional promoter compatible with the host and the 
plasmid selected can be utilized. Promoters used 
for recombinant DNA construction in E. coli include 
the beta-lactamase (penicillinase) and lactose pro- 

45 moter such as disclosed by Chang et al., Nature 
275: 6I5 0978); Itakura et al., Science, I98; I056 - 
(I977); promoter systems such as disclosed by 
Andersen et al., Mol. Cel. Bol. 3, 562-569 (I983) 
and Tryptophan promoter systems such as dis- 

50 closed by Goeddel et al., Nucleic acids Res. 8, 
4057 (I980) also EPO Appl. No. 0036776. In yeast, 
promoters including but being not limited to those 
from ADCI, GAU, GALI0. PH05. PGKl and GAPI 
have been used as reviewed by Broach et al. - 

55 (I983) in Experimental Manipulation of Gene Ex- 
pression, M. Inouyd, ed„ Academic press: New 
York, N.Y., pp 83-II7. While these are the most 
commonly used, other microbial promoters have 
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been discovered and utilized and details concern* 
ing their nucleotide sequences have been pub- 
lished, enabling a skilled worker to ligate them 
functionally in an operable relationship to the genes 
in transformation vectors [Sibenlist.et al. t Cell 20, 
269 (I980)]. 

A wide variety of host/cloning vehicle combina- 
tions may be employed in cloning the double- 
stranded DNA. For example, useful cloning ve- 
hicles may consist of segments of chromosomal, 
nonchromosomal and synthetic DNA sequences, 
such as various known bacterial plasmids, e.g., 
plasmids from E. coii such as pBR322, phage 
DNA, and vectors derived from combinations of 
plasmids and phage DMAs such as piasmids which 
have been modified to employ phage DNA or other 
expression control sequences or yeast plasmids. 
Useful hosts may include microorganisms, mam- 
malian cells, plant cells and the like. Among them 
microorganisms and mammalian cells are prefer- 
ably employed- As preferable microorganisms, 
there may be mentioned yeast such as S. 
cerevisiae and bacteria such as Escherichia coii, 
Bacillus subtillis, Bacillus stearothermophilus and 
Actinomyces. The above-mentioned vectors and 
hosts may also be employed for the production of 
a protein from a gene obtained biologically as in 
the instant invention. Of course, not all host/vector 
combinations may be equally efficient The particu- 
lar selection of hostfdoning vehicle combination 
may be made by those skilled in the art after due 
consideration of the principles set forth without 
departing from the scope of this invention. 

Furthermore, within each specific cloning ve- 
hicle, various sites may be selected for insertion of 
the double-stranded DNA. These sites are usually 
designated by the restriction endonuclease which 
cuts them. For example, in pBR322, the EcoRI site 
is located just outside the gene coding for ampicil- 
lin resistance. Various sites have been employed 
by others in their recombinant synthetic schemes. 
Several sites are well recognized by those of skill 
in the art It is, of course, to be understood that a 
cloning vehicle useful in this invention need not 
have a restriction endonuclease site for insertion of 
the chosen DNA fragment Instead, the vehicle 
could be joined to the fragment by alternative 
means. 

The vector or cloning vehicle and in particular 
the site chosen therein for attachment of a selected 
DNA fragment to form a recombinant DNA mol- 
ecule is determined by a variety of factors, e.g. 
number of sites susceptible to a particular restric- 
tion enzyme, size of the protein to be expressed, 
susceptibility of the desired protein to proteolytic 
degradation by host cell enzymes, contamination of 
the protein to be expressed by host cell proteins 
difficult to remove during purification, expression 



characteristics, such as the location of start and 
stop codons relative to the vector sequences, and 
other factors recognized by those of skill in the art 
The choice of a vector and an insertion site for a 

5 particular gene is determined by a balance of these 
factors, not all selections being equally effective for 
a given case. 

There are several known methods of inserting 
DNA sequences into cloning vehicles to form re- 

w combinant DNA molecules which are equally useful 
in this invention. These include, for example, direct 
ligation, synthetic linkers, exonuclease and 
polymerase-Hnked repair reactions followed by liga- 
tion, or extension of the DNA strand with DNA 

is polymerase and an appropriate single stranded 
template followed by ligation. 

It should, of course, be understood that the 
nucleotide sequences of the DNA fragment insert- 
ed at the selected site of the cloning vehicle may 

20 include nucleotides which are not part of the actual 
structural gene for the desired polypeptide/protein 
or may include only a fragment of the complete 
structural gene for the desired protein. It is only 
required that whatever DNA sequence is inserted, a 

25 transformed host will produce a protein/peptide 
having an immunological activity to the AIDS gag- 
protein or that the DNA sequence itself is of use as 
a hybridization probe to select clones which con- 
tain DNA sequences useful in the production of 

30 polypeptides/proteins having an immunological ac- 
tivity to the AIDS gag-protein. 

The cloning vehicle or vector containing the 
foreign gene is employed to transform a host so as 
to permit that host to express the protein or portion 

35 thereof for which the hybrid DNA codes. The selec- 
tion of an appropriate host is also controlled by a 
number of factors recognized by the art These 
include, for example, compatibility with the chosen 
vector, toxicity of proteins encoded by the hybrid 

40 plasmid, ease of recovery of the desired protein, 
expression characteristics, biosafety and costs. A 
balance of these factors must be struck with the 
understanding that not all hosts may be equally 
effective for expression of a particular recombinant 

45 DNA molecule. 

Once the organism capable of carrying out the 
expression of the gag gene has been created, the 
process of this invention can be carried out in a 
variety of ways depending upon the nature of the 

so construction of the expression vectors for the gag 
gene and upon the growth characteristics of the 
host Typically, the host organism will be grown 
under conditions which are favorable to production 
of a large quantities of cells. When a large number 

55 of cells has accumulated suitable inducers or de- 
repressors in the growth medium cause the pro- 
moter supplied with such gene sequence to be- 
come active permitting the transcription and trans- 
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lation of the coding sequence. The protein pro- 
duced by the recombinant cell can be iysated by 
conventional means well known in the art It is 
apparent that the particular means of lysating will 
depend upon the host eel! utilized. 

In a preferred embodiment of this invention, the 
HTLV-III gag gene is introduced into a yeast ex- 
pression vector with a promoter which is not ac- 
tivated in a phosphate medium. This promoter was 
obtained from PH05 as described by Thill et at., 
Mol. Ceii Biol. 3, 570-579 (I963). The gag gene was 
obtained from a 5.5Kb Eco Rl fragment from the 
HTLV-III clone X HXB-3 [Shaw et al. Science 226, 
II65-II7I (I984)]. This is the Eco RI fragment of the 
HTLV-III genome illustrated in Rg. I-A. 

Fig. I-B illustrates the formation of the hybrid 
gene containing the PH05 promoter and the gag 
gene fragment in Rg. I-A. The yeast promoter and 
translation initiation site are on a 560bp BamHI to. 
Ahalll restriction fragment derived from the gene 
for repressive acid phosphatase, PH05. The en-* 
zyme Aha lll produces a blunt end just after the 
second codon of PH05 as seen in Rg. I-B. The 
gag gene within the 5.5 kd EcoRl fragment is 
ligated to the promoter obtained from the BamHI to 
Ahalll restriction fragment derived from PH05. The 
EcoRl fragment from the HTLV-III clone contains 
the entire gag gene and a large part of the pol 
gene which overlaps the gag gene in a different 
reading frame (Ratner et al., supra; Sanchez- 
Pescador et al., supra; Wain-Hobson et al., supra; 
and Muesing et al. t supra). The EcoRl fragment 
was cut with Clal near the amino terminus of the 
gag gene and the resulting 5' overlap was filled in 
with DNA polymerase large fragment to create a 
blunt-ended DNA molecule beginning with an ar- 
ginine codon. Ligation of this end to the Ahalll of 
the PH05 fragment fused the promoter and the 
first two codons of PH05 to the fifteenth codon of 
the gag gene. 

The fused gene prepared above was then in- 
serted into a pYE7 vector, a vector that can both 
replicate in yeast and E. coli. This recombinant 
plasmid was labelled pYE72/gagl. The resulting 
plasmid was then used to transform yeast. 

In carrying out this ligation, any conventional 
method of ligation can be utilized to fuse the pro- 
moter to the gag gene. Any conventional method of 
transforming a microorganism such as yeast with a 
plasmid can be utilized to produce the recombinant 
organism which will express the gag gene. 

The PH05 promoter in the plasmid labelled 
pYE72/gagl was induced in the yeast cells by 
growth in a phosphate-free medium and extracts of 
the ceils were analyzed for the present of the gag- 
specific proteins by immunoblot analysis using rab- 
bit polyclonal antiserum to disrupted virus as 
shown in Rg. 2. The column labelled I, in Rg. 2, 



represents the results from the cells with 
pYE72/gagl grown in a high phosphate containing 
medium whereas column 2 represents the results 
from cells with pYE72/gagl grown in a phosphate- 

5 free medium. The indicated molecular weights are 
used to determine the sizes of the reactive band. 
As seen from Rg. 2, there was no expression of 
the gag gene in the phosphate containing medium 
were the promoter was not activated to produce the 

10 gag gene. On the other hand, when the same cells 
were grown in a phosphate-free medium, the 
lysates produced immuno reactive proteins which 
correspond to the gag protein and its various 
known proteolytic proteins which are formed there- 

75 from. 

As seen from column 2 in Rg. 2, a major 
immunoreactive protein produced corresponded in 
size to the HTLV-III p24 gag protein identified in 
virions and predicted from the known DNA se- 

20 quence. The reactive species of the sizes expected 
for the pl4 and pi6 proteins were detected as well. 
A larger protein of about 56 kd size which cor- 
responded to the entire gag protein as well as 
several species of about 40 kd which represents a 

25 proteolytic processing intermediate were also pro- 
duced. 

To determine whether the recombinant HTLV- 
III gag protein is actually processed by proteolysis 
in yeast to produce the actual viral specific prot- 

30 eolytic proteins, the proteins obtained from the 
lysate of the yeast transformed with pYE72/gagl 
was followed in a pulse chase experiment with S*- 
methione or °PO*. Rg. 3 shows the results of this 
experiment each of the columns represent a dif- 

as ferent chase time as follows: A) 0 min, B) 2 min, 
C) 5 min, D) 10 min, E) 20 min, F) 30 min, G) 45 
min. in all respects the complete gag gene of 56 
kd was detected with radioactivity first. The protein 
of about 25 kd seen in Rg. 2 was also detected 

40 before the 24 Kd protein and thus may be its 
immediate precursor. Protein migrating as expect- 
ed of I6 Kd was also detected. The p!4 region of 
the gag gene has no methionine. Therefore, the 
failure to detect this protein supports its proposed 

45 origin. 

Phosphorylation of the recombinant gag pro- 
teins made in yeast was examined after growth of 
the yeast cells transformed with pYE72/gagl in a 
low phosphate medium containing "PO*. In Rg. 3 

so columns H and I show the results of gel analysis of 
labelled proteins immunopercipitated with rabbit 
HTLV-III antibodies. In column H where the trans- 
formed yeast was utilized, the 56 kd, one of the 40 
kd intermediates, the 25 kd, the 24 kd and the 16 

55 kd species were all radioactive. No radioactivity 
was seen at the position of the 14 kd band when a 
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higher percentage of gel was run. Thus the p24 
and pl6 gag protein appear to be phosphorylated in 
yeast No gag proteins were seen when the non- 
transformed yeast was utilized as in column I. 

As seen from Rg. I-A presence of a Bglll 
restriction site near the carboxy-terminaJ coding 
region of the gag gene provides a means to elimi- 
nate about half of the p!6 coding region as well as 
most of the pol gene present in 5.5 kd EcoRI 
fragment The carboxy-terminal part of the gag 
polyprotein and/or the amino terminal region of the 
pol gene has been shown to encode a processing 
protease in many retroviruses (see, for example, 
Dickson et al. r supra). 

In order to remove the protease from the re- 
combinant gag protein the Bglll to EcoRI portion of 
the EcoRI fragment of HTLV-III genome was re- 
moved (see Rg. I-B). It is believed that the re- 
moved portion of the gag gene encodes proteases 
which convert the large recombinant gag protein 
into te various mature protein species. Therefore, 
the BamHI-EcoRI gene prepared in Rg. I-B can be 
fragmented by treating either the gene or the plas- 
mid with Bglll so as to remove the Bglll to EcoRI 
fragment from this gene. If the expression plasmid 
pYE72/gagl is used, this plasmid is opened by 
digesting with BamHI and Bglll. Once the BamHI to 
Bglll portion of this EcoRI fragment is obtained, this 
portion may be inserted into a suitable plasmid, 
depending upon the microorganism into which it is 
to be grown. In inserting this portion of the gene 
into a plasmid it may be necessary to ligate this 
portion to series of DNA bases which permit inser- 
tion into the desired plasmid. In the case where 
pYE7 is utilized, the BamHI to Bglll fragment is 
ligated with the 375 bp BamHI-EcoRI fragment 
from pBR322 to form a BamHI to EcoRI fragment 
which can be inserted into pYE7 to form 
pYE72/gag2. 

In accordance with another embodiment of this 
invention another mutation of the gag gene was 
introduced at the Bell restriction site in the pol 
gene just downstream of the gag, i.e. pYE72/gag3. 
This mutation involves introducing four base pairs 
at the Bell site which cause a frame shift out of the 
pol reading frame as shown in Rg. 5. In Rg. 5 the 
carboxy terminal coding region of the gag gene 
and the amino terminal coding region of the pol 
gene are shown. The region that is homologous to 
other gag proteases is underlined in this figure. 
The reading frame created by the Bell fill in is 
shown by the arrow. A period indicates a tran- 
slations terminal site. The frame shift caused by 
the insertion of the bases deactivated the gene 
from producing certain proteases which may prot- 
eolytically cleave the gag protein. 



The products formed by the full recombinant 
gag gene and the truncated gag genes are shown 
in figure 4. Column I in Rg. 4 represent in> 
munoblots taken from lysates of cells induced with 

5 pYE72/gagl. Column 2 in Rg. 4 is directed to the 
result of immunoblots taken from lysates from cells 
induced with pYE72/gag2; and column III repre- 
sents immunoblots taken from lysates from cells 
induced with pYE72/gag3. As seen, pYE72/gagl 

to and 3 produced the large protein p56. For 
pYE72/gag3, the major immunoreactive protein 
band was about 56 kd and comigrated with the 
presumptive precursor protein seen in the cells 
with induced pYE72/gagl. However, the mutation 

75 also appears to prevent processing of this recom- 
binant gag precursor. As with the deletion mutant, 
a smaller amount of p24 appeared to be present as 
compared with protein seen in the ceils induced 
with pYE72/gagl. In addition to a major 56 kd 

20 product and the smaller species, a band at about 
60 kd was detected in the frame shift mutant This 
size would be consistent with gag/pol fusion prod- 
uct that was terminated as shown in Rg. 5 as a 
result of the Bell filling in. 

25 

Screening of AIDS SERA 

Because anti-HTLV-III antibodies are found in 

30 more than 90% of the AIDS patients, the micro- 
bially synthesized gag gene products can be used 
as diagnostic tools for the detection of these anti- 
bodies. For this analysis as seen in Rgures 6-A 
and 6-B, total cell protein from the yeast culture 

35 induced with pYE72/gagl was fractionated by SDS- 
PAGE and transferred to a nitrocellulose filter by 
Western blotting technique. Strips of the filter con- 
taining transferred proteins were reacted with I000- 
fold diluted human sera, and the antigen-antibody 

40 ~ complexes formed were detected by incubation of 
the strips with I25-I-Iabeiled Staphylococcus aureus 
protein A followed by autoradiography. Prominent 
bands corresponding to reaction of the antibody to 
the 56 kd, 24 kd, I6 kd and !4 kd proteins were 

45 consistently observed when the serum used was 
from patients with AIDS syndrome. The results of 
one such assay with II human sera from patients on 
the East cost are presented in Rgure 6-A. Similar 
results with II serum samples from west cost pa- 

50 tients appear in Rg. 6-B. The negative controls (not 
shown) used were normal human sera. No reaction 
observed with sera from healthy individuals. 

It appears, therefore, that the recombinant gag 
gene products can be used as diagnostic reagents 

55 for the detection of AIDS associated antibodies. 
The recombinant gag gene products of the instant 
invention encompasses a large portion of the pro- 
tein molecule and contains both the conserved and 
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divergent portions of the molecule. In spite of the 
divergence observed between HTLV-III and ARV-2 
sequences the recombinant gag protein products of 
the instant invention synthesized by the bacteria 
react with AIDS patient sera derived from both 
geographical locations of the United States. One 
hundred percent (100%) of AIDS patient sera (22 
Individual samples, II derived from the East Coast 
of the United States and II derived from California) 
tested showed high reactivity. This is strong evi- 
dence for the presence of conserved epitopes with- 
in the molecule against which the immune system 
could mount an antibody reaction. The human im- 
mune system may thus be mounting an immune 
response against conserved epitopes of the gag 
proteins molecule by the reactivity of the AIDS 
patient sera. 

Based on these discoveries it is proposed that 
in the practice of screening blood for Acquired 
immune Deficiency Syndrome (AIDS), the AIDS 
recombinant gag protein products of this invention 
can be utilized. Utilizing the protein products of the 
instant invention, human blood can be screened for 
the presence of antibodies to the AIDS virus. This 
and other techniques are readily determined. The 
foregoing and other objects, features and advan- 
tages of the invention will be apparent from the 
following examples of preferred embodiments of 
the invention. 

In the Examples, E cofi strain MCI06I was the 
same as described by Casadaban et a!., J. Mol. 
Biol. {38, 179-207 (1980). The E. cgli strain GMII9 
used for the preparation of the unmethylated DNA, 
was that described by Arraj et al„ J. Bact. 153, 562- 
563 (1983). E. coli strains MC 1061 and GM 119 were 
deposited at American Type Culture Collection - 
(ATCC) on November 26, 1985 the accession nos. 
being ATCC 53338 and ATCC 53339, respectively. 
The yeast expression plasmid pYE7 used in Exam- 
pie 5 was the same as described in Examples and 
Fig. 6 of European Patent Application publication 
No. 0124824 which is incorporated by reference. 
The yeast used was S. cerevisiae 20B-I2 (ATCC 
No. 20626). Yeast transformation was performed as 
described by Hinnen et a!., Proc. Nat. Acad. ScL 
USA, 75, 1929-1934 (1978). The PH05 gene utilized 
to produce the promoter was obtained from the 
plasmid pAP20 as described by Andersen et at., 
Mol. Cell Biol. 3 562-569 (1983). The X HXB-3 used, 
was as described by Shaw et a!., Science 226, 1165- 
If7l (1984). 



Example I 



Preparation of pYE72/oaal 

Restriction and DNA modifying enzymes were 
used as recommended by the supplier. All restric- 

5 tion enzyme digests were performed at 37°C for I 
hr with 0.5-1.0 units of enzyme in 50mM NaCI, 
lOmM Tris-HCI, pH 7.4, lOmM MgCI* ImM dithioth- 
reitol (DTT). The 560 bp BamHI to Ahalll fragment 
with PH05 promoter and translation initiation region 

to was obtained from the plasmid pAP20. and the Clal 
to EcoRI fragment with the gag/pol region was 
obtained from X HXB-3. The 5.5 kb. Ecol fragment 
was subcloned into pBR3222 and the plasmid 
grown in E. coli GMII9. The Clal 5' overhang of the 

75 Clal to EcoRI-fragment was "filled in" by treatment 
with 5 units of the Klenow fragment of E. coli DNA 
polymerase I in the presence of all four deox- 
yribonucleotides at 50U.M (C.TA and G) at I6°C 
for 2 hr. in the same buffer as that used for 

20 restriction enzyme reactions. 

Approximately equal amounts of the BamHI- 
Ahalll PH05 fragment and the Clal (filled-in)-EcoRI 
gag/pol fragment were treated with 1.0 units of T4 
DNA ligase for 16 hr at I6 P C in 50mM Tris-HCI pH 

25 7.8, lOmM MgCI,, 20mM DTT, ImM «ATP. The 
products were cut with BamHI and EcoRI and the 
PH05-gag/pol fusion was inserted by ligation into 
pYE7 which had been cut with BamHI and EcoRI. 
The resulting expression plasmid was designated 

30 pYE72/gagl. 



Example 2 

35 Preparation of pYE72/aao2 

A deletion that removed the carboxy-terminal 
portion of gag and all of pol was made by digesting 
pYE72/gagl with BamHI and Bgill and isolating the 

40 PH05-gag fragment This was converted to a 
BamHI to EcoRI fragment by ligation with the 375 
bp BamHI-EcoRI fragment from pBR322 to the 
Bgill site through the identical 5' overhangs of Bgill 
and BamHI. Insertion of this fragment into pYE7 

45 yielded PYE72/gag2. 



Example 3 

so Preparation of oYE72/oaQ3 

Another mutation was introduced at the Bell 
site in the pol gene just downstream of gag. Since 
Bell does not cut methylated DNA, pYE72/gagl was 
55 introduced into E. coli GMI19, and DNA was pre- 
pared and cut with Bell. The 5' overhang was filled 
in with Klenow fragment as above and the plasmid 
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was recircularized by blunt-end ligation. The result- 
ing plasmid with the 4 bp GATC insert at the Bell 
site was designated pYE72/gag3. 



Example 4 

Growth and induction of Yeast Strains 

Separate cultures of the yeast strain S. io 
cerevisiae 20B-12 containing the gag expression 
plasmids or a vector with no inserted gene were 
grown in the medium YCAO and induced in 
phosphate-free medium as described by Kramer et 
aU Proc. Natl. Acad. Sci USA 81, 367-370 (1984). 75 
About 6 hr after transfer to phosphate-free medium, 
cells were collected by centrifugation and 
spheroplasts prepared with Zymo lyase 60,000 - 
[Kaneko, T., Kitamura, K., and Yamamoto, Y. f Agr. 
Biol. Chem., 37, 2295 (1973)] and collected. 20 
spheroplasts were usually stored at -20°C before 
analysis. For protein purification, whole ceils in- 
stead of spheroplasts are collected and frozen. 



Example 5 

Purification of the recombinant gag Protein Pro- 
ducts 

30 

A homogeneous recombinant gag protein or its 
proteolytic products can be purified according to 
the following procedure. The induced yeast cells 
are broken by standard mechanical procedures. 
These include passage through a French Pressure 35 
Cell or rapid mixing with glass beads In a mixer 
such as a Bead Beater, a Dyna-Mili or a Braun 
Homogenizer. For any of the above, the cell paste 
is resuspended in approximately 2 volumes - 
(relative to cell pellet) of 50mM NaP0 4 , pH7.4 buff- 40 
er. For glass bead lysis, an equal volume of 0.5 
mm glass beads Is added. Lysis is accomplished 
by either three passages at 20,000 p.sJ. through 
the French Pressure Cell or as recommended by 
the manufacturer of the mixer. Lysis can be mon- 45 
itored by microscopy. 

Following cell breakage, ceil debris (and glass 
beads) are removed by two 10 min centrifugation at 
600 *g. Another centrifugation at 12,000 K g for 20 
min removes mitochondria. The proteins are then so 
fractionated by centrifugation at 100,000 *g for I hr 
to obtain a pellet (microsomal fraction) and super- 
natant (soluble fraction). If the gag proteins are in 
the microsomal fraction, solubilization using either 
various ionic and/or non-ionic detergents or de- 55 
naturing reagents such as urea or guanidine HCI is 
necessary prior to further purification. 



Additional purification can be achieved by stan- 
dard liquid chromatography procedures. Different 
chromatography media can be used to obtain frac- 
tionation by gel filtration, ion exchange chromatog- 
raphy, and/or affinity chromatography. For gag pro- 
teins, single-stranded ONA cellulose affinity 
chromatography should be useful since gag pro- 
teins bind to nucleic acids. 

Final purification can be obtained by reverse 
phase high performance liquid chromatography - 
(HPLC). The HPLC step yields the precursor gag 
protein and the natural proteolysis proteins derived 
therefrom in a substantially 100% pure form. It is 
also foreseeable that monoclonal antibody affinity 
chromatography columns utilizing gag polyclonal or 
monoclonal antibodies to the precursor gag protein 
and the natural proteolysis proteins, could be used 
as an alternative to HPLC. 

By the above purification procedure, one ob- 
tains the following products: 

The 56 kd protein having the structure given in 
Fig 8; 

The 24 kd protein having the structure given in 
Fig 10; 

The 16 kd protein having the structure given in 
Fig 12; 

The 14 kd protein having the structure given in 
Fig 14; 
and 

The 48 kd protein having the structure given in 
Rg 16. 



Example 6 

Polvacrvlamide Gel Electrophoresis and Western 
Blot Analysis 

For the immunoblot analysis of Figs. 2, 3, 4, 6- 
A and 6-B, cells were iysed by resuspending the 
spheroplast pellets (approximately 10" cells) in an 
equal volume of 2 * sample buffer of Laemmli - 
(Laemmli, "Cleavage of Structural Proteins During 
the Assembly of the Head of Bacteriophage T4", 
Nature 227, 68-685, 1970) and incubated at 95 °C 
for five (5) minutes. Debris were pelleted by cen- 
trifugation and the cleared lysates were subjected 
to SDS-PAGE analysis, Id. For Western blot analy- 
sis, the proteins from the aery lam ide gel were 
eiectroblotted onto a 0.1 micrometers nitrocellulose 
membrane (Schleicher and Schuell) for 16 hr at 
50V, in 12.5 mM Tris, 96 mM glycine, 20% metha- 
nol, 0.01% SDS at pH 7.5. Processing of the blot 
was carried out using the methods described by 
Towbin et a!., "Eectrophoretic Transfer of Proteins 
From Polyacrylamide Gels to Nitrocellulose Sheet 
Procedure and Some applications," Proc. Natl. 
Acad. Sci. U.S.A.. 76, 4350-4354, (1979). For treat- 
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ment with the human sera, the blots were incu- 
bated with a 1000 fold dilution of the sera in anti- 
body buffer (20 mM sodium phosphate buffer, pH 
7.5 containing 0.5 M NaCI, l% BSA and 0.05% 
Tween 20) for 2-6 hr. The blots were then washed 5 
twice with phosphate buffered saline containing 
0.05% Tween 20 and then incubated with I25-I- 
labelled Staphylococcus aureus protein A for an 
additional period of I hr. The blot was then washed 
twice in PBS-Tween 20 buffer, dried and auto- to 
radiographed. 



Example 7 

75 

Labelling gf the Cells 

The cells were labelled with either ^S- 
methionine pulse-chase or "PO* for the pulse 
chase immunopre cipitation of Fig. 3 by the follow- 20 
ing procedure. For the "S-methionine pulse-chase, 
500 mCi ^S-methionine was added to the culture 
medium. After a 2 min labelling period, uniabeiled 
methionine was added. At various times,the cells 
were harvested, lysed and processed by im- 25 
munoprecipitation with rabbit antibody, and the 
lysates were analyzed by SDS-PAGE and auto- 
radiography. The chase times are as follows: A) 0 
min, B) 2 min, C) 5 min, D) 10 min. E) 20 min, F) 30 
min, G) 45 min. For the *PO* labeling in Columns 30 
H and I, I mCi "PO* was added to the culture 
medium. Labeling was continued for 30 min. at 
30 °C, then cells were harvested, and lysates pro- 
cessed for immunoprecipitation as described 
above. Lysates were from cells containing 35 
pYE72/gagl (H), and cells containing a similar plas- 
mid with no gag gene (I). 



Example 8 40 

Diagnostic Test for AIDS 

It is clear that the recombinant precursor gag 
protein and the natural proteolysis proteins derived as 
therefrom, of the instant invention may be used as 
diagnostic reagents for the detection of AIDS-Asso- 
ciated antibodies, it is also apparent to one of 
ordinary skill that a diagnostic assay for AIDS using 
polyclonal or monoclonal antibodies to the AIDS so 
recombinant gag precursor protein or the prot- 
eolytic products may be used to detect the pres- 
ence of the AIDS virus in human blood. In one 
embodiment a competition immunoassay is used 
where the antigenic substance, in this case the 55 
AIDS virus, in a blood sample competes with a 
known quantity of labelled antigen, in this case 
labelled AIDS recombinant precursor gag protein. 



or the proteolysis proteins derived therefrom for a 
limited quantity of antibody binding sites. Thus, the 
amount of labelled antigen bound to the antibody is 
inversely proportional to the amount of antigen in 
the sample. In another embodiment, an im- 
munometric assay may be used wherein a labelled 
AIDS gag antibody which complexes with the 
antigen-bound antibody is directly proportional to 
the amount of antigen (AIDS virus) in the blood 
sample. In a simple yes/no assay to determine 
whether the AIDS virus is present in blood, the 
solid support is tested to detect the presence of 
labelled antibody. In another embodiment, mon- 
oclonal antibodies to recombinant precursor AIDS 
gag protein, or the natural proteolysis proteins de- 
rived therefrom, may be used in an immunometric 
assay. Such monoclonal antibodies may be ob- 
tained by methods well known in the art, particu- 
larly the process of Miistein and Kohler reported in 
Nature 256, 495-497 (I975). The antigens in this 
assay can be the recombinant gag precursor pro- 
tein or the proteolysis proteins derived therein ei- 
ther in pure form or as a mixture of these proteins. 

The immunometric assay method is as follows: 
Duplicate samples are run in which 100 ul of a 
suspension of antibody immobilized on agarose 
particles is mixed with 100 ul of serum and 100 ul 
of soluble ^l-labelled antibody. This mixture is for 
specified times ranging from one-quarter hour to 
twenty-four hours. Following the incubation periods 
the agarose particles are washed by addition of 
buffer and then centrifuged. After removal of the 
washing liquid by aspiration, the resulting pellet of 
agarose particles is then counted for bound ia5 l- 
labeiled antibody. The counts obtained for each of 
the complexes can then be compared to controls. 

Various features of the invention are set forth in 
the following claims. 



Claims 

1. A polypeptide immunologically equivalent to 
the gag-protein products of HTLV-III having the 
amino acid sequence given in either Fig. 8 or Fig. 
16, or I4kd, I6kd or 24kd proteolytic polypeptide 
fragments thereof or polypeptides related to any of 
said polypeptides by amino acid substitution(s) 
which occur through mutations of a recombinant 
host cell which produces said polypeptides. 

2. The polypeptide of claim I wherein said 
polypeptide is the 56kd precursor having the amino 
acid sequence: 

MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAV- 
NPGLLETSEGCRQILG QLQPSLQTGSEELRSLYN- 
TVATLVCVHQRIEIKDTKEALDKIEEEQNKSK 
KKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQA- 
ISPRTLNAWVKWEEK AFSPEVIPMFSALSEGATP- 



12 



0230 222 



24 



QDLrJTMLNTVGGHQAAMQMLXETINEEAAEW 

DRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQ!- 

GWMTNNPPIPVGEIY KRWIILGLNKIVRMYSPTSIL- 

DIRQGPKEPFRDYVDRFYKTLRAEQASQE 

VKIWMTETLLVQNANPDCKTILKALGPAATLEEN- 

MTACQGVGGPGHKARV LAEAMSQVTNTATIMM- 

QRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKG 

CWKCGKEGHQMKDCTERQANFLGKIWPSYKGRP- 

GNFLQSRPEPTAPPFLQ SRPEPTAPPEESLRSG- 

VETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 

3. The polypeptide of claim I wherein the poly- 
peptide is a I4kd proteolytic fragment having the 
amino acid sequence: 

MFRWEK1RLRPGGKKKYKLXHIVWASRELERFAV- 
NPGLLETSEGCRQILG QLQPSLQTGSEELRSLYN- 
TVATLYCVHQR1EIKDTKEALDK1EEEQNKSK 
KKAQQAAADTGHSSQVSQNY 

4. The polypeptide of claim I wherein said 
polypeptide is the 48kd proteolytic fragment having 
the amino acid sequence: 

MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAV- 

NPGLLETSEGCRQILG QLQPSLQTGSEELRSLYN- 

TVATLYCVHQRIEIKDTKEALDKIEEEQNKSK 

KKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQA- 

ISPRTLNAWVKWEEK AFSPEVIPMFSALSEGATP- 

QDLNTMLNTVGGHQAAMQMLKETINEEAAEVV 

DRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQl- 

GWMTNNPPIPVGE1Y KRWHLGLNKIVRMYSPTSIL- 

DIRQGPKEPFRDYVDRFYKTLRAEQASQE 

VKNWMTETLLVQNANPDCKTILKALGPAATLEEM- 

MTACQGVGGPGHKARV LAEAMSQVTNTAT1MM- 

QRG N FRN QRKIVKCFNCGKEG HIARNCRAP RKKG 

CWKCGKEGHQMKDCTERQANFLGKIHRTGWAM- 

(A 

5. The polypeptide of claim I wherein said 
polypeptide is the I6kd proteolytic fragment having 
the amino acid sequence: 

MQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKK- 
GCWKCGKEGHQMKDCT ERQANFLGKIWPSYKG- 
RPGNFLQSRPEPTAPPFLQSRPEPTAPPEESLRS 
GVETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 

6. The polypeptide of claim I wherein said 
polypeptide is the 24kd proteolytic fragment having 
the amino acid sequence: 

PIVQNIQGQMVHQAISPRTLNAVvVKVVEEKAFSPE- 
VIPMFSALSEGATPQ DLrTTMLNTVGGHQAAMQM- 
LKETINEEAAEWDRVHPVHAGPIAPGQMREPR 
GSDIAGTTSTLQEQIGWMTNNPPIPVGEJYKRWIIL- 
GLNKIVRMYSPTSI LDI RQGPKEP FRDYVD RFYKT- 
LRAEQASQEVKNWMTETLLVQNANPDCKT ILKAL- 
GPAATLEEMMTACQGVGGPGHK 

7. A polypeptide immunologically equivalent to 
the gag-protein products of HTLV-II! expressed in 
yeast 



8. A gene containing a gene portion having a 
DNA sequence encoding a polypeptide as claimed 
in any one of claims I to 6 operafaly linked to a 
promoter capable of effecting the expression of 

5 said DNA sequence. 

9. The gene of claim 8 wherein said gene 
portion is the Clal to EcoRI restriction site fragment 
of the X HXB-3 genome. 

10. The gene of claim 9 wherein the promoter is 
to the BamHI to Ahalll restriction site fragment of 

PH05. 

11. The gene of claim 8 wherein said gene 
portion is the Clal to BglH restriction site fragment 
of the Clal to EcoRI fragment of the X HXB-3 

is genome. 

12. The gene of claim II wherein the promoter is 
the BamHI to Ahalll restriction site fragment of 
PH05. 

13. The gene of claim 8 where the gene portion 
20 is the Clal to EcoRI restriction site fragment of X 

HXB-3, wherein the Clal to EcoRr fragment is filled 
in at its Bell restriction site with the base sequence 
GATC. 

14. The gene of claim 13 wherein the promoter 
25 is the BamHI to Ahalll restriction site fragment of 

PH05. 

15. A gene containing a gene portion having a 
DNA sequence encoding a polypeptide im- 
munologically equivalent to the gag-protein pro- 

30 ducts of HTLV-HI operably linked to a promoter 
capable of effecting the expression of said DNA 
sequence in yeast 

16. A recombinant expression vector capable of 
effecting the expression of a polypeptide as 

35 claimed in any one of claims I to 6 containing a 
gene portion having a DNA sequence encoding 
said polypeptide operably linked to a promoter 
capable of effecting the expression of said DNA 
sequence. 

40 17. A vector according to claim 16 wherein the 
gene portion is the Clal to EcoRI restriction frag- 
ment of the X HXB-3 genome. 

18. A vector according to claim 17 wherein the 
promoter is the BamHI to Ahalll restriction site 

45 fragment of PH05. 

19. A vector according to claim 16 wherein the 
gene portion is the Clal to Bglll restriction site 
fragment of the Cla I to EcoRI restriction site frag- 
ment of the X HXB-3 genome. 

so 20. A vector according to claim 19 wherein the 

promoter is the BamHI to Ahalll restriction site 

fragment of PH05. 

21. A vector according to claim 16 wherein the 

gene portion is the Clal to EcoRI restriction site 
55 fragment of X HXB-3, wherein the Clal to EcoRI 

fragment is filled in at the Bell restriction site with 

the base sequence GATC. 
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22. A vector according to claim 21 wherein the 
promoter is the BamHI to Ahalll restriction site 
fragment of PH05. 

23. A vector according to claim 18 which is 
pYE72/gagl. 

24. A vector according to claim 20 which is 
pYE72/gag2. 

25. A vector according to claim 22 which is 
pYE72/gag3. 

26. A recombinant expression vector capable 
of effecting the expression of a polypeptide im- 
munologically equivalent to the gag-protein pro- 
ducts of HTLV-III in yeast containing a gene portion 
having a DNA sequence encoding the gag-protein 
of HTLV-III operably linked to a promoter capable 
of effecting the expression of said DNA sequence. 

27. A transformed cell carrying a vector as 
claimed in any one of claims 16 to 25. 

28. A transformed cell according to claim 27 
which is a yeast cell. 

29. A transformed cell according to claim 28 
which is a S. cerevisiae yeast cell. 

30. A transformed cell according to claim 29 
which is S. cerevisiae 20B-I2. 

31. A transformed yeast cell carrying a vector 
as claimed in claim 26. 

32. A polypeptide according to any one of 
claims I to 7 as constituent of a vaccine. 

33. A polypeptide according to any one of 
claims I to 7 as antigen. 

34. A process for producing a polypeptide as 
claimed in any one of claims I to 6 comprising: 
transforming a host cell with an expression vector 
as claimed in any one of claims 16 to 25; 

culturing said host cell so that the protein 
products are expressed; and, 

extracting and isolating said protein products. 

35. A process according to claim 34 wherein 
said host cell is a yeast cell 

36. A process according to claim 35 wherein 
said yeast cell is a S. cerevisiae ceil. 

37. A process for producing a polypeptide im- 
munologically equivalent to the gag-protein pro- 
ducts of HTLV-III comprising: transforming a yeast 
cell with an expression vector as claimed in claim 
26; 

culturing said yeast cell so that the protein 
products are expressed; and, 

extracting and isolating said protein products. 

38. A method of testing human blood for the 
presence of antibodies to the viral etiologic agent 
of AIDS which comprises mixing a composition 
containing a polypeptide as claimed in any one of 
claims I to 7 or mixtures thereof with a sample of 
human blood and determining whether said protein 
or any of its natural proteolytic proteins or mixtures 
thereof binds to AIDS antibodies present in the 
blood sample. 



39. Vaccines containing a polypeptide as 
claimed in any one of claims I to 7 and a phys- 
iologically acceptable carrier. 

40. Antibodies raised aqainst a polypeptid as 
s claimed in any one of claims I to 7. 

41. The antibodies of claim 40 which are mon- 
oclonal antibodies. 

42. The use of a polypeptide as claimed in any 
one of claims I to 7 for the preparation of a protec- 

70 tive immunization vaccine. 

43. The use of a polypeptide as claimed in any 
one of claims I to 7 for the preparation of anti- 
bodies against AIDS virus. 

44. The use of a polypeptide as claimed in any 
T5 one of claims I to 7 for testing human blood for the 

presence of AIDS virus. 

45. A test kit for the determination of antibodies 
against AIDS virus comprising in a container a 
polypeptide according to any one of claims I to 7. 

20 46. A test kit for the determination of AIDS 
virus comprising in a container antibodies against 
AIDS virus elicited by a polypeptide according to 
any one of claims I to 7. 

25 Claims for the following Contracting States : AT; ES 

1. A process for producing polypeptides im- 
munologically equivalent to the gag-protein pro- 
ducts of HTLV-III having the amino acid sequence 

30 given in Fig. 8 or Fig. 16, or I4kd, I6kd or 24kd 
proteolytic fragments thereof, which process com- 
prises: transforming a host cell with an expression 
vector comprising a gene coding for said gag- 
protein products operably linked to a promoter 

35 sequence enabling transcription, translation and ex- 
pression of said gag-protein products in said host 
cell; 

culturing said host cell so that the protein 
products are expressed; and, 
40 extracting and isolating said protein products. 

2. A process according to claim I, character- 
ized in that a host cell is transformed with an 
expression vector capable of expressing a polypep- 
tide with the amino acid sequence: 

45 MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAV- 
NPQLLETSEGCRQILG QLQPSLQTGSEELRSLYN- 
TVATLYCVHQRIEIKDTKEALDKIEEEQNKSK 
KKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQA- 
ISPRTLNAWVKWEEK AFSPEVIPMFSALSEGATP- 

50 QDLNTMLNTVGGHQAAMQMLKETINEEAAEW 

DRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQI- 
GWMTNNPPIPVGE1Y KRWIILGLNKIVRMYSPTSIL- 
DIRQGPKEPFRDYVDRFYKTLRAEQASQE 
VKNWMTETLLVQNANPDCKTILKALGPAATLEEM- 

55 MTACQGVGGPGHKARV LAEAMSQVTNTATIMM- 
QRGNFRNQRK1VKCFNCGKEGHIARNCRAPRKKG 
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CWKCGKEGHQMKDCTERQANFLGKIWPSYKGRP- 
GNFLQSRPEPTAPPFLQ SRPEPTAPPEESLRSG- 
VETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 

3. A process according to claim I, character- 
ized in that a host ceil is transformed with an s 
expression vector capable of expressing a polypep- 
tide with the amino add sequence: 
MFRWEKIRLRPGGKKKYKU<HIVWASRELERFAV- 
NPGLLETSEGCRQILG QLQPSLQTGSEELRSLYN- 
TVATLYCVHQRIEIKDTKEALDKIEEEQNKSK to 
KKAQQAAADTGHSSQVSQNY 

4. A process according to claim I, character- 
ized in that a host cell is transformed with an 
expression vector capable of expressing a polypep- 
tide with the amino acid sequence: 75 
MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAV- 
NPGLLETSEGCRQILG QLQPSLQTGSEELRSLYN- 
TVATLYCVHQRIEIKDTKEALDKIEEEQNKSK 
KKAQQAAADTGHSSQVSQNYPIVQNIQGQIvlVHQA- 
ISPRTLNAWVKWEEK AFS PEV1 P M FS ALS EGATP- 20 
QDLNTMLNTVGGHQAAMQMLXETINEEAAEW 
DRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQI- 
GWMTNNPPIPVGEIY KRWIILGLNKJVRMYSPTSIL- 
DIRQGPKEPFRDYVDRFYKTLRAEQASQE 
VKNW^ETLLVQNANPDCKTILKALGPAATLEEM- 25 
MTACQGVGGPGHKARV LAEAMSQVTNTATIMM- 
QRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKG 
CWKCG KEG HQMKDCTERQAN FLGKIHRTG WAM- 

IA 

5. A process according to claim I t character- 30 
ized in that a host cell is transformed with an 
expression vector capable of expressing a polypep- 
tide with the amino acid sequence: 
MQRGNFRNQRKIVKCFNCGKEGHIAflNCRAPRKK- 
GCWKCGKEGHQMKDCT ERQANFLGKIWPSYKG- 35 
RPGNFLQSRPEPTAPPFLQSRPEPTAPPEESLRS 
GVETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 

6. A process according to claim I, character- 
ized in that a host cell is transformed with an 
expression vector capable of expressing a polypep- 40 
tide with the amino acid sequence: 
PIVQNIQGQMVHQAISPRTLNAWWWEEKAFSPE- 
VIPMFSALSEGATPQ DLNTMLNTVGGHQAAMQM- 
LKETINEEAAEWDRVHPVHAGPIAPGQMREPR 
GSDIAGTTSTLQEQJGWMTNNPPIPVGEIYKRWHL- 46 
GLNKiVRMYSPTSI LDIRQGPKEPFRDYVDRFYKT- 
LRAEQASQEVK^4WMTETLLVQNANPDCKT ILKAL- 
GPAATLEEMMTACQGVGGPGHK 

7. A process according to any one of claims I 

to 6, characterized in that as a host cell a yeast cell so 
is used. 

8. A process according to claim 7, character- 
ized in that as a yeast cell a S. cerevisiae cell is 
used. 

9. A process for producing polypeptides im- 55 
munologically equivalent to the gag-protein pro- 
ducts of HTLV-III comprising: transforming a yeast 

cell with an expression vector comprising a gene 



coding for said gag-protein products operably 
linked to a promoter sequence enabling transcrip- 
tion, translation and expression of said gag-protein 
products in said yeast cell; 

cufturing said yeast cell so that the protein 
products are expressed; and, 

extracting and isolating said protein products. 

10. A process for the preparation of an expres- 
sion vector capable in a host cell of effecting the 
expression of a polypeptide as defined in any one 
of claims I to 6, which process comprises isolating 
a gene coding for said polypeptides and operably 
linking said gene with a promoter sequence. 

11. A process according to claim 10, character- 
ized in that a promoter sequence capable of effec- 
ting expression in a yeast cell is used. 

12. A process according to ciaim ll t character- 
ized in that a promoter sequence capable of effec- 
ting expression in S. cerevisiae cell is used. 

13. A process according to claim 12/ character- 
ized in that a PH05 promoter sequence is used. 

14. A process according to any one of claims 10 
to 13, characterized in that as a gene the Clal to 
EcoRI restriction size fragment of the XHXB-3 
genome is used. 

15. A process according to any one of claims 10 
to (3, characterized in that as a gene the Clal to 
BglH restriction site fragment of the Clal to EcoRI 
restriction site fragment of the XHXB-3 genome is 
used. 

16. A process according to any one of claims 10 
to 13, characterized in that as a gene the ClaJ to 
EcoRI restriction site fragment of XHXB-3, wherein 
the Clal to EcoRI fragment is filled in at the Bell 
restriction site with the base sequence GATC, is 
used. 

17. A process for the preparation of an expres- 
sion vector capable in a yeast cell of effecting the 
expression of polypeptides immunologically equiv- 
alent to the gag-protein products of HTLV-III, which 
process comprises isolating a gene coding for said 
polypeptides and operably linking said gene with a 
promoter sequence. 

18. A process for the preparation of a trans- 
formed cell carrying an expression vector capable 
in a host cell of effecting the expression of a 
polypeptide as defined in any one of claims I to 6, 
which process comprises transforming a host cell 
with said expression vector by methods known in 
the art 

19. A process according to claim 18, character- 
ized in that as a host cell a yeast cell is used. 

20. A process according to claim 19, character- 
ized in that as a yeast cell a S. cerevisiae cell is 
used. 

21 A process according to claim 20. character- 
ized in that as a S. cerevisiae cell S. cerevisiae 
20B-12 is used. 
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22. A process for the preparation of a trans- 
formed yeast cell carrying an expression vector 
capable in a host cell of effecting the expression of 
polypeptides immunologically equivalent to the 
gag-protein products of HTLV-III, .which process 5 
comprises transforming a yeast cell with said ex- 
pression vector by methods known in the art 

23. A process for testing human blood for the 
presence of antibodies to the viral etiologic agent 

of AIDS which method comprises mixing a com- 10 
position containing a polypeptide as defined in any 
one of claims I to 6 or 9 or mixtures thereof with a 
sample of human blood and determining whether 
said protein or any of its natural proteolytic proteins 
or mixtures thereof binds to AIDS antibodies is 
present In the blood sample. 

24. A process for the preparation of a vaccine 
comprising mixing a polypeptide as defined In any 
one of claims I to 6 or 9 with a physiologically 
acceptable carrier. 20 

25. A process for the preparation of antibodies 
against AIDS virus comprising injecting a mam- 
malin or avian animal with a sufficient amount of a 
polypeptide as defined in any one of claims I to 6 

or 9 and recovering said antibodies from the serum 25 
of said animals. 

26. Antibodies raised against a polypeptid as 
defined in any one of claims I to 6 or 9. 

27. The antibodies of claim 26 which are mon- 
oclonal antibodies. 30 

28. A polypeptide immunologically equivalent 
to the gag-protein products of HTLV-III whenever 
prepared by a process as claimed in any one of 
claims I to 9. 

29. A gene containing a gene portion having a 35 
DNA sequence encoding a polypeptide as defined 

in any one of claims I to 6 operably linked to a 
promoter capable of effecting the expression of 
said DNA sequence. 

30. The gene of claim 29 wherein said gene aq 
portion is the Clal to EcoRI restriction site fragment 

of the X HXB-3 genome. 

31. The gene of claim 30 wherein the promoter 
is the Bam HI to Aha III restriction site fragment of 
PH05. 45 

32. The gene of claim 29 wherein said gene 
portion is the Clal to Bglll restriction site fragment 
of the Clal to EcoRI fragment of the X HXB-3 
genome. 

33. The gene of claim 32 wherein the promoter so 
is the BamHI to Aha III restriction site fragment of 
PH05. 

34. The gene of claim 29 where the gene 
portion is the Clal to EcoRI restriction site fragment 

of X HXB-3. wherein the Clal to EcoRI fragment is 55 
filled in at its Bell restriction site with the base 
sequence GATC. 



35. The gene of claim 34 wherein the promoter 
is the BamHI to Ahalll restriction site fragment of 
PH05. 

36. A gene containing a gene portion having a 
DNA sequence encoding the gag-protein products 
of HTLV-III operably linked to a promoter capable 
of effecting the expression of said DNA sequence 
in yeast. 

37. A recombinant expression vector capable 
of effecting the expression of a polypeptide as 
defined in any one of claims 1 to 6 containing a 
gene portion having a DNA sequence encoding 
said polypeptide operably linked to a promoter 
capable of effecting the expression of said DNA 
sequence. 

38. A vector according to claim 37 wherein the 
gene portion is the Clal to EcoRI restriction frag- 
ment of the X HXB-3 genome. 

39. A vector according to claim 38 wherein the 
promoter is the BamHI to Ahalll restriction site 
fragment of PH05. 

40. A vector according to claim 37 wherein the 
gene portion is the Clal to Bglll restriction site 
fragment of the Cla I to EcoRI restriction site frag- 
ment of the X HXB-3 genome. 

41. A vector according to claim 40 wherein the 
promoter is the BamHI to Ahalll restriction site 
fragment of PH05. 

42. A vector according to claim 37 wherein the 
gene portion is the Clal to EcoRI restriction site 
fragment of X HXB-3, wherein the Clal to EcoRI 
fragment is filled in at the Bell restriction site with 
the base sequence GATC. 

43. A vector according to claim 42 wherein the 
promoter is the BamHI to Ahalll restriction site 
fragment of PH05. 

44. A vector according to claim 39 which is 
pYE72/gagi. 

45. A vector according to claim 41 which is 
pYE72/gag2. 

46. A vector according to claim 43 which is 
pYE72/gag3. 

47. A recombinant expression vector capable 
of effecting the expression of the gag-protein pro- 
ducts of HTLV-III in yeast containing a gene portion 
having a DNA sequence encoding the gag-protein 
of HTLV-III operably linked to a promoter capable 
of effecting the expression of said DNA sequence. 

48. A transformed cell carrying a vector as 
claimed in any one of claims 37 to 46. 

49. A transformed cell according to claim 48 
which is a yeast cell. 

50. A transformed cell according to claim 49 
which is a S. cereviviae yeast cell. 

51. A transformed cell according to claim 50 
which is S. cerevisiae 20B-I2. 
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52. The use of a polypeptide as defined in any 
one of claims I to 6 or 9 for the preparation of a 
protective immunization vaccine. 

53. The use of a polypeptide as defined in any 

one of claims I to 6 or 9 for the preparation of s 
antibodies against AIDS virus. 

54. The use of a polypeptide as defined in any 
one of claims I to 6 or 9 for testing human blood 
for the presence of AIDS virus. 

55. A test kit for the determination of antibodies ro 
against AIDS virus comprising in a container a 
polypeptide as defined in any one of claims I to 6 

or 9. 

56. A test kit for the determination of AIDS 
virus comprising in a container antibodies against 75 
AIDS virus elicited by a polypeptide as defined in 

any one of claims I to 6 or 9. 
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Figure 5 (cont.) 
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Figure 6-A 
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Figure 6-B 
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Figure 7 



GAG 56 SEQ 

ATGTTTCGATGGGAAAAAATTCGGTTAAGGCCAGGG6GAAAGAAAAAATATAAATTAAAACA 
TATAGTATGGGCAAGCAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACAT 
CAGAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAAGAA 
CTTAGATCATTATATAATACAGTAGCAACCCTCTATTGTGTGCATCAAAGGATAGAGATAAA 
AGACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAAGCAC 
AGCAAGCAGCAGCTGACACAGGACACAGCAGTCAGGTCAGCCAAAATTACCCTATAGTGCAG 
AACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGGGTAAA 
AGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTATCAGAAG 
GAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAAGCAGCCATG 
CAAATGTTAAAAGAGACCATCAATGAGGAAGCTGCAGAATGGGATAGAGTACATCCAGTGCA 
TGCAGGGCCTATTGCACCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCAGGAACTA 
CTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCTATCCCAGTAGGAGAA 
ATTTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGAATGTATAGCCCTACCAG 
CATTCTGGACATAAGACAAGGACCAAAAGAACCCTTTAGAGACTATGTAGACCGGTTCTATA 
AAACTCTAAGAGCCGAGCAAGCTTCACAGGAGGTAAAAAATTGGATGACAGAAACCTTGTTG 
GTCCAAAATGCGAACCCAGATTGTAAGACTATTTTAAAAGCATTGGGACCAGCAGCTACACT 
AGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGGACCCGGCCATAAGGCAAGAGTTTTGG 
CTGAAGCAATGAGCCAAGTAACAAATACAGCTACCATAATGATGCAGAGAGGCAATTTTAGG 
AACCAAAGAAAGATTGTTAAGTGTTTCAATTGTGGCAAAGAAGGGCACATAGCCAGAAATTG 
CAGGGCCCCTAGGAAAAAGGGCTGTTGGAAATGTGGAAAGGAAGGACACCAAATGAAAGATT 
GTACTGAGAGACAG6CTAATTTTTTAGGGAAGATCTGGCCTTCCTACAAGGGAAGGCCAGGG 
AATTTTCTTCAGAGCAGACCAGAGCCAACAGCCCCACCATTTCTTCAGAGCAGACCAGAGCC 
AACAGCCCCACCAGAAGAGAGCCTCAGGTCTGGGGTAGAGACAACAACTCCCTCTCAGAAGC 
AGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTGGCAACGAC 
CCCTCGTCACAATAA 
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FIGURE 8 

GAG56.PEP 

P56 

MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILG 
QLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALDKIEEEQNKSK 
KKAQQAAADTGHSSQVSQNYPIVQNIQGQMVHQAISPRTLNAWVKWEEK 
AFSPEVIPMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKETINEEAAEW 
DRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIY 
KRWIILGLNKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQE 
VKNWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARV 
LAEAMSQVTNTATIMMQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKG 
CWKCGKEGHQMKDCTERQANFLGKIWPSYKGRPGNFLQSRPEPTAPPFLQ 
SRPEPTAPPEESLRSGVETTTPSQKQEPIDKEIYPLTSLRSLFGNDPSSQ 
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FISURF 9 

GAG24.SEQ 

CCTATAGTGCAGAACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAA 
I5?5IS55I AA M GTAGTAGAAGA6AAGGCTTTCAGC CCAGAAGTAATACCCATGTTTTCAG 
KIISK^ A 6 GGAGCCACCCCACAAGATTTAAACA CCATGCTAAACACAGTGGGGGGACAT 
CAAGCAGCCATGCAAATGTTAAAAGAGACCATCAATGAGGAAGGTGCAGAATGGGATAGAGT 
J?JIKi5I5J A ISK^ GCCTATTGCACCAGGCCAGAT SAGAGAACCAAGGGGAAGTGACA 
JfSKKJ A £I A K^T^ CCCTTCAGGAACAAATAGGA TGGATGACAAATAATCCACCTATC 
fCAGTAGGAGAAATTTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGAATGTA 
I AGGGG IACCAGCATTCTGGACATAAGACAAGGACCAAAAGAACCCTTTAGAGACTATGTAG 
?S^SII?J$T^?£I9T A 5 GAGCCGAGCAAGCTTCACAGGAGGT AAAAAATTGGATGACA 
GAAACCTTGTTGGTCCAAAATGCGAACCCAGATTGTAAGACTATTTTAAAAGCATTGGGACC 
AGCAGCTACACTAGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGGACCCGGCCATAAG 



0 230 222 



FIGURE 10 

GAG24.PEP 

P24 

PIVQNIQGQMVHQAISPRTLNAWVKWEEKAFSPEVIPMFSALSEGATPQ 
DLNTMLNTVGGHQAAMQMLKETINEEAAEWDRVHPVHAGPIAPGQMREPR 
GSDIAGTTSTLQEQIGWMTNNPPIPVGEIYKRWIILGLNKIVRMYSPTSI 
IDIRQGPKEPFRDYVDRFYKTLRAEQASQEVKNWMTETLLVQNANPDCKT 
ILKALGPAATLEEMMTACQGVGGPGHK 
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FIGURE 11, 

GAG16.SE0 

ATGCAGAGAGGCAATTTTAG6AACCAAAGAAAGATTGTTAA6T6TTTCAATT6TGGCAAAGA 
AGGGCACATAGCCAGAAATTGCAGGGCCCCTAGGAAAAAGGGCTGTTGGAAATGTGGAAAGG 
AAGGACACCAAATGAAAGATTGTACTGAGAGACAGGCTAATTTTTTAGGGAAGATCTGGCCT 
TCCTACAAGGGAAGGCCAGGGAATTTTCTTCAGAGCAGACCAGAGCCAACAGCCCCACCATT 
TCTTCAGAGCAGACCAGAGCCAACAGCCCCACCAGAAGAGAGCCTCAGGTCTGGGGTAGAGA 
CAACAACTCCCTCTCAGAAGCAGGAGCCGATAGACAAGGAACTGTATCCTTTAACTTCCCTC 
AGATCACTCTTTGGCAACGACCCCTCGTCACAA 
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FlGURF 17 

GA616.PEP 

P16 

MQRGNFRNQRKIVKCFNCGKEGHIARIOAPRKKGCWKCGKEGHQMKDCT 
ERQANFLGKIWPSYKGRPGNFLQSRPEPTAPPFLQSRPEPTAPPEESLRS 
GVETTTPSQKQEPIDKELYPLTSLRSLFGNDPSSQ 
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FI6URF 15 

GAG14.SEQ 

JISIIKSJIK5?5WWII^ST TAAGGCCAGGGGGAAAGAAAAA ATATAAATTAAAACA 

TATAGTATGGGCAA6CAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCT6TTAGAAACAT 

CAGAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAAGAA 

CTTAGATCATTATATAATACAGTAGCAACCCTCTATTGTGTGCATCAAAGGATAGAGATAAA 

AGACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGC 

AGCAAGCAGCAGCTGACACAGGACACAGCAGTCAGGTCAGCCAAAATTAC 
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FIGURE 1H 

GAG14.PEP 

?m 

MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILG 
QLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALDKIEEEQNKSK 
KKAQQAAADTGHSSQVSQNY 
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FIGURE 15 

GAG48.SEQ 

ATGTTTCGATGGGAAAAAATTCGGTTAAGGCCAGGGGGAAAGAAAAAATATAAATTAAAACA 
TATAGTATGGGCAAGCAGGGAGCTAGAACGATTCGCAGTTAATCCTGGCCTGTTAGAAACAT 
CAGAAGGCTGTAGACAAATACTGGGACAGCTACAACCATCCCTTCAGACAGGATCAGAAGAA 
CTTAGATCATTATATAATACAGTAGCAACCCTCTATTGTGTGCATCAAAGGATAGAGATAAA 
AGACACCAAGGAAGCTTTAGACAAGATAGAGGAAGAGCAAAACAAAAGTAAGAAAAAAGCAC 
AGCAAGCAGCAGCTGACACAGGACACAGCAGTCAGGTCAGCCAAAATTACCCTATAGTGCAG 
AACATCCAGGGGCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGGGTAAA 
AGTAGTAGAAGAGAAGGCTTTCAGCCCAGAAGTAATACCCATGTTTTCAGCATTATCAGAAG 
GAGCCACCCCACAAGATTTAAACACCATGCTAAACACAGTGGGGGGACATCAAGCAGCCATG 
CAAATGTTAAAAGAGACCATCAATGAGGAAGCTGCAGAATGGGATAGAGTACATCCAGTGCA 
TGCAGGGCCTATTGCACCAGGCCAGATGAGAGAACCAAGGGGAAGTGACATAGCAGGAACTA 
CTAGTACCCTTCAGGAACAAATAGGATGGATGACAAATAATCCACCTATCCCAGTAGGAGAA 
ATTTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGAATGTATAGCCCTACCAG 
CATTCTGGACATAAGACAAGGACCAAAAGAACCCTTTAGAGACTATGTAGACCGGTTCTATA 
AAACTCTAAGAGCCGAGCAAGCTTCACAGGAGGTAAAAAATTGGATGACAGAAACCTTGTTG 
GTCCAAAATGCGAACeCAGATTGTAAGACTATTTTAAAAGCATTGGGACCAGCAGCTACACT 
AGAAGAAATGATGACAGCATGTCAGGGAGTAGGAGGACCCGGCCATAAGGCAAGAGTTTTGG 
CTGAAGCAATGAGCCAAGTAACAAATACAGCTACCATAATGATGCAGAGAGGCAATTTTAGG 
AACCAAAGAAAGATTGTTAAGTGTTTCAATTGTGGCAAAGAAGGGCACATAGCCAGAAATTG 
CAGGGCCCCTAGGAAAAAGGGCTGTTGGAAATGTGGAAAGGAAGGACACCAAATGAAAGATT 
GTACTGAGAGACAGGCTAATTTTTTAGGGAAGATCCACAGGACGGGTGTGGTCGCCATGATC 
GCGTAG 
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FIGURE 16 

GAG48.PEP 

P48 

MFRWEKIRLRPGGKKKYKLKHIVWASRELERFAVNPGLLETSEGCRQILG 
QLQPSLQTGSEELRSLYNTVATLYCVHQRIEIKDTKEALDKIEEEQNKSK 
KKAQQAAADTGHSSQVSQNYP I VQN IQGQMVHQA I SPRTLNAWVK WEEK 
AFSPEVI PMFSALSEGATPQDLNTMLNTVGGHQAAMQMLKET I NEEAAEW 
DRVHPVHAGPIAPGQMREPRGSDIAGTTSTLQEQIGWMTNNPPIPVGEIY 
KRWIILGLNKIVRMYSPTSILDIRQGPKEPFRDYVDRFYKTLRAEQASQE 
VKNWMTETLLVQNANPDCKTILKALGPAATLEEMMTACQGVGGPGHKARV 
LAEAMSQVTNTATIMMQRGNFRNQRKIVKCFNCGKEGHIARNCRAPRKKG 
CWKCGKEGHQMKDCTERQANFLGKIHRTGVVAMIA 
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